AdaMast: A Drum Sound Recognizer based on Adaptation and Matching of Spectrogram Templates
نویسندگان
چکیده
This paper describes a template-matching-based system, called AdaMast, that detects onset times of the bass drum, snare drum, and hi-hat cymbals in polyphonic audio signals of popular songs. AdaMast uses the power spectrograms of the drum sounds as templates. However, there are two main problems in transcribing drum sounds in the presence of other sounds. The first problem is that actual drum-sound spectrograms cannot be prepared as templates beforehand for each song. The second problem is that power spectrograms of sound mixtures including the drum sound are greatly different from the template (pure drum-sound spectrogram). To solve the first problem, a template-adaptation algorithm is built into AdaMast. To solve the second problem, a distance measure used in the template matching is designed to be robust to the spectral overlapping of other sounds. The test results in Audio Drum Detection Contest were 72.8%, 70.2%, and 57.4% in transcribing the bass drums, snare drums, and hi-hat cymbals, respectively, and AdaMast won the contest.
منابع مشابه
Automatic Drum Sound Description for Real-World Music Using Template Adaptation and Matching Methods
This paper presents an automatic description system of drum sounds for real-world musical audio signals. Our system can represent onset times and names of drums by means of drum descriptors defined in the context of MPEG-7. For their automatic description, drum sounds must be identified in such polyphonic signals. The problem is that acoustic features of drum sounds vary with each musical piece...
متن کاملDrum sound identification for polyphonic music using template adaptation and matching methods
This paper describes drum sound identification for polyphonic musical audio signals. It is difficult to identify drum sounds in such signals because acoustic features of those sounds vary with each musical piece and precise templates for them cannot be prepared in advance. To solve this problem, we propose new template-adaptation and templatematching methods. The former method adapts a single s...
متن کاملDrum Detection from Polyphonic Audio via Detailed Analysis of the Time Frequency Domain
This publication presents a method for the automatic detection and classification of three distinct drum instruments in real world musical signals. The regarded instruments are kick, snare and hi-hat as agreed by the participants of the contest category Audio Drum Detection within the 2nd Annual Music Information Retrieval Evaluation eXchange (MIREX 2005). There are two challenging issues inher...
متن کاملDrum Transcription in Polyphonic Music Using Non-Negative Matrix Factorisation
We present a system that is based on the non-negative matrix factorisation (NMF) algorithm and is able to transcribe drum onset events in polyphonic music. The magnitude spectrogram representation of the input music is divided by the NMF algorithm into source spectra and corresponding time-varying gains. Each of these source components is classified as a drum instrument or non-drum sound and a ...
متن کاملUniversität Augsburg Audio Brush : Smart Audio Editing in the Spectrogram
Starting with a novel audio analysis and editing paradigm, a set of new and adaptive audio analysis and editing algorithms in the spectrogram are developed and integrated into a smart visual audio editing tool in a “what you see is what you hear” style. At the core of our algorithms and methods is a very flexible audio spectrogram that goes beyond FFT and Wavelets and supports manipulating a si...
متن کامل